Picture for Weilin Zhao

Weilin Zhao

Spava: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention

Add code
Jan 29, 2026
Viaarxiv icon

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Add code
Jan 21, 2026
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design

Add code
May 29, 2025
Viaarxiv icon

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Add code
Feb 20, 2025
Viaarxiv icon

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Add code
Feb 17, 2025
Viaarxiv icon

Densing Law of LLMs

Add code
Dec 05, 2024
Figure 1 for Densing Law of LLMs
Figure 2 for Densing Law of LLMs
Figure 3 for Densing Law of LLMs
Figure 4 for Densing Law of LLMs
Viaarxiv icon

Enabling Real-Time Conversations with Minimal Training Costs

Add code
Sep 18, 2024
Figure 1 for Enabling Real-Time Conversations with Minimal Training Costs
Figure 2 for Enabling Real-Time Conversations with Minimal Training Costs
Figure 3 for Enabling Real-Time Conversations with Minimal Training Costs
Figure 4 for Enabling Real-Time Conversations with Minimal Training Costs
Viaarxiv icon

Configurable Foundation Models: Building LLMs from a Modular Perspective

Add code
Sep 04, 2024
Figure 1 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 2 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 3 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Figure 4 for Configurable Foundation Models: Building LLMs from a Modular Perspective
Viaarxiv icon

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Add code
Aug 03, 2024
Figure 1 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 2 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 3 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Figure 4 for MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Viaarxiv icon